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Abstract 

The well-founded semantics is one of the most widely studied and used semantics of logic 
programs with negation. In the case of finite propositional programs, it can be computed 
in polynomial time, more specifically, in 0(\At(P)\ x size(P)) steps, where size(P) denotes 
the total number of occurrences of atoms in a logic program P. This bound is achieved by 
an algorithm introduced by Van Gelder and known as the alternating-fixpoint algorithm. 
Improving on the alternating-fixpoint algorithm turned out to be difficult. In this paper 
we study extensions and modifications of the alternating-fixpoint approach. We then re- 
strict our attention to the class of programs whose rules have no more than one positive 
occurrence of an atom in their bodies. For programs in that class we propose a new im- 
plementation of the alternating-fixpoint method in which false atoms are computed in a 
top-down fashion. We show that our algorithm is faster than other known algorithms and 
that for a wide class of programs it is linear and so, asymptotically optimal. 



1 Introduction 



The well-founded semantics was introduced in (Van Gelder et ai, 1991) to provide 
3-valued interpretations to logic programs with negation. Since its introduction, 
the well-founded semantics has become one of the most widely studied and most 



commonly accepted approaches to negation in logic programming (Alferes et 



1995 



Fitting, 1991; Chen et ai, 1995; Chen fc Warren, 1996; Zukowski et ai, 1997 



Brass & Dix, 1998). It was implemented in several top-down reasoning systems, 



most prominent of which is XSB (Rao et ai, 1997) 



The well-founded semantics is closely related to the stable-model semantics (Gel- 



fond & Lifschitz, 1988), another major approach to logic programs with negation. 



The well-founded semantics approximates the stable- model semantics (Van Gelder 



et ai, 1991; Fitting, 2001). Moreover, computing the well-founded model of proposi- 



tional programs is polynomial (Van Gelder, 1989) while computing stable models is 



NP-hard (Marek & Truszczynski, 1991). Consequently, evaluating the well-founded 



semantics can be used as an effective preprocessing technique in algorithms to com- 



pute stable models ( Subrahmanian et al., 1995 ). In addition, as demonstrated by 
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smodels (Niemela & Simons, 1996), at present the most advanced and most effi- 
cient system to compute stable models of DATALOG^ programs, the well-founded 
semantics can be used as a powerful lookahead mechanism. 

Despite the importance of the well-founded semantics, the question of how fast it 



can be computed has not attracted significant attention. Van Geldcr (Van Gelder 



1989) described the so called alternating-fixpoint algorithm. Van Gelder's algorithm 



runs in time 0(\At(P)\ x size(P)), where At(P) is the set of atoms occurring in a 
logic program P, \At(P)\ denotes the cardinality of At(P), and size(P) is the size of 
P (the total number of atom occurrences in P). Improving on this algorithm turned 
out to be difficult. The first progress was obtained in ( Bcrman et ai, 1995 ). The 
algorithm described there, when restricted to programs whose rules contain at most 
two positive occurrences of atoms in their bodies, runs in time 0(|j4i(P)| 4 / 3 |P| 2 / 3 + 
size(P)), where \P\ is the number of rules in P. For programs whose rules have no 
more than one positive atom in the body a better estimate of 0(|j4i(P)| 3 ' 2 |P| :l ' 2 + 
size(P)) was obtained. For some classes of programs this is an asymptotically better 
estimate than the 0(\At(P)\ x size(P)) estimate that holds for the algorithm by 
Van Gelder. 

A different approach to computing the well-founded model was proposed in 



(Zukowski et ai, 1997; Brass et ai, 2001). It is based on the notion of a program 



transformation (Brass & Dix, 199S). The authors describe there several transfor- 
mations that can be implemented in linear time and that simplify a program while 
(essentially) preserving the well-founded semantics. These transformations are: the 
positive reduction, success, negative reduction, and failure (PSNF transformations, 
for short). They allow one to compute in linear time the Kripke-Kleene seman- 
tics (Fitting, 1985) of the program. To compute the well-founded semantics one 
also needs to detect the so-called positive loops. The complexity of this task dom- 
inates the asymptotic complexity of the well-founded semantics computation. No 



improved algorithms for the positive- loop detection are offered in ( Brass et al. 



2001) so the worst-case asymptotic complexity of the algorithm presented there 



remains the same as that of the alternating-fixpoint method. However, due to the 
use of PSNF transformations, that simplify the program, the algorithm based on 
program transformations may in practice run faster. In contrast to the approach 
studied in ( Brass et ai, 2001 ), we focus here on the positive- loop detection task. 

The alternating-fixpoint algorithm works by successively improving lower ap- 
proximations T and F to the sets of atoms that are true and false (under the 
well-founded semantics), respectively. The algorithm starts with T = 0. Using this 
estimate, it computes the first estimate for F. Next, using this estimate, in turn, 
it computes a better estimate for T. The algorithm continues until further im- 
provements are not possible. It returns the final sets T and F as the well-founded 
semantics. A dual version of the alternating-fixpoint algorithm, starting with F = 
and then alternatingly computing approximations to T and P, is also possible. The 
most time-consuming part of this algorithm is in computing estimates to the set of 
atoms that are false (in this part, in particular, positive loops are detected). In the 
Van Gelder algorithm, the best possible approximation (given the current estimate 
for T) is always computed by using a bottom-up approach. 
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In this paper we focus on the problem of detecting positive loops and computing 
new false atoms. We restrict our attention to the class of programs that have at 
most one positive atom in the body. We denote this class of programs by CP\ . We 
show that for programs from CP\, false atoms can be computed by means of a top- 
down approach by finding atoms that do not have a proof. Moreover, we show that 
it is not necessary to find all atoms that can be established to be false at a given 
stage. Finding a proper subset (as long as it is not empty) is also sufficient and 
results in a correct algorithm. We apply these techniques to design a version of an 
alternating-fixpoint algorithm computing the well-founded semantics of programs 
from the class CV\. We demonstrate that the resulting algorithm is asymptotically 
better than the original alternating-fixpoint algorithm by Van Gelder. Specifically, 
we show that our algorithm runs in time 0(\At(P)\ 2 + size(P)). Thus, for programs 
with size(P) > \At(P)\ 2 , our algorithm runs in linear time and is asymptotically 
optimal! It is also easy to see that when \P\ > \At(P)\, the asymptotic estimate of 
the running time of our algorithm is better than that of algorithms by Van Gelder 
( Van Gelder, 1989 ) and Berman et al. (Berman et al, 1995| ). 

As mentioned above, our approach is restricted to the class CP\. Applicability 
of our method can, however, be slightly extended. Let us denote by CPf the class 
of these logic programs that, after simplifying by means of PSNF transformations 
(or, equivalently, with respect to the Kripke-Kleene semantics) fall into the class 
CP\. Since PSNF transformations (the Kripke-Kleene semantics) can be computed 
in linear time, the asymptotic estimate of the running time of our method extends 
to all programs in the class LV\ . 

The paper is organized as follows. In the next section we provide a brief review 
of the key notions and terminology. In Section |3| we describe several modifications 
to the original Van Gelder algorithm, we show their correctness and estimate their 
running time. The ultimate effect of our considerations there is a general template 
for an algorithm to compute the well-founded semantics. Any algorithm computing 
some (not necessarily all) atoms that can be established as false given a current 
estimate to the well-founded can be used with it. One such algorithm, for programs 
from the class CfP\ 1 is described and analyzed in Section^. It constitutes the main 
contribution of the paper and yields a new, currently asymptotically most efficient 
algorithm for computing the well-founded semantics for programs in CV\. The last 
section contains conclusions. 



2 Preliminaries 

We start by reviewing basic concepts and notation related to logic programs and 
the well-founded semantics, as well as some simple auxiliary results. In the paper 
we consider the propositional case only. 

Let P be a normal logic program. By At(P) we denote the set of atoms occurring 
in P. Let M C At(P) (throughout the paper we often drop a reference to P from our 
notation, whenever there is no danger of ambiguity) . By Pm we denote the program 
obtained from P by removing all rules whose bodies contain negated literals of 
the form not (a), where a S M. Further, by P h we denote the program obtained 



4 



Z. Lone and M. Truszczynski 



from P by removing from the bodies of its rules all negative literals. Clearly, the 
program {Pm) H coincides with the Gelfond-Lifschitz reduct of P with respect to M 
(throughout the paper, we write Pj^ for (Pm) h , to simplify notation). The Gelfond- 
Lifschitz operator on the algebra of all subsets of At, GL (following our convention, 
we omit the reference to P from the notation), is defined by 

GL(M) = LM(P^), 

where LM(Q) stands for a least model of a Horn program Q. 

We now present characterizations of the well-founded semantics. We phrase them 
in the language of operators and their fixpoints. All operators considered here are 
defined on the algebra of subsets of At{P). We denote a least fixpoint (if it exists) 
of an operator O by lfp(0). 

It is well known that GL is antimonotone. Consequently GL 2 = GL o GL is 
monotone and has a least fixpoint. The set of atoms that are true with respect to 
the well-founded semantics of a program P, denoted by T w j s , is precisely the least 



fixpoint of the operator GL 2 , that is, T w f s = lfp(GL 2 ) ( |Van Gelder, 1989| ; [Fitting 



2001). The set of atoms that are false with respect to the well-founded semantics 



of a program P, denoted by F w f s , is given by GL(T w f s ) (throughout the paper, X 
denotes the complement of a set X with respect to At(P)). 
One can define a dual operator to GL 2 by 



A(M) = GL(GL(M)). 

It is easy to see that A is monotone and that its least fixpoint is F w f s . Thus, 
F wfs = lfp(A) and T wfs = GL(F wfs ). 

We close this section by discussing ways to compute GL(M) for a given finite 
propositional logic program P and a set of atoms M C At(P). A straightforward 
approach is to compute the Gelfond-Lifschitz reduct P^ and then to compute its 
least model. The resulting algorithm is asymptotically optimal as it runs in time 
linear in the size of the program. However, in this paper we will use a different 
approach, more appropriate for the computation of the well-founded semantics. Let 
P be a logic program with negation. We define At~(P) = {not(a): a 6 At(P)}. For 
every set M C At(P) U At~(P), we define true(M) = Mfl At(P). If we interpret 
literals of At~(P) as new atoms, then for every set M C At(P), the program 
P U not(M) can be viewed as a Horn program. Thus, it has a least model. It is 
easy to see that 

GLp(M) = true(LM(P U not(M))). 

Here, P appearing at the left-hand side of the equation stands for the original 
logic program, while P appearing at the right-hand side of the equation stands for 
the same program but interpreted as a Horn program. Thus, using the algorithm 
of Dowling and Gallier ( Dowling fc Gallier, 1984] ), the Gelfond-Lifschitz reduct 



can be computed in time 0(size(P) + \M\) = 0(size(P)) (since M C At{P), 
\M\ = 0{size{P))). 
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3 Algorithms 

The departure point for our discussion of algorithms to compute the well-founded 
semantics is the alternating- fixpoint algorithm of Van Gelder (Van Gelder, 1989). 
Using the terminology introduced in the previous section it can be formulated as 
follows. 



Algorithm 1 (Van Gelder) 

F:= 0; 
repeat 

T := true(LM(P U not(F)); (* or equivalently: T := GL(F); *) 

F := LM(P%); (* or equivalently: GL(T); *) 
until no change in F; 
return T and F. 

Let F' and F" be the values of the set F just before and just after an iteration 
of the repeat loop in Algorithm 1. Clearly, 

F" = GL(GL{F T )) = A(F'). 

Thus, after iteration i of the repeat loop, F = A l (%). Consequently, it follows from 
our earlier remarks that when Algorithm 1 terminates, the set F that is returned 
satisfies F — F w f s . Since there is no change in F in the last iteration, when the 
algorithm terminates, we have T = T w f s . That is, Algorithm 1 is correct. 

We will now modify Algorithm 1. The basis for Algorithm 1 is the operator A. 
This operator is not progressive. That is, M is not necessarily a subset of A(M). 
We will now introduce a related progressive operator, say B, and show that it can 
be used to replace A. Let P be a logic program and let T and F be two subsets of 
At(P). By Pf,t we denote the program obtained from P by removing 

1. all rules whose heads are in F 

2. all rules whose bodies contain a positive occurrence of an atom from F 

3. all rules whose bodies contain a negated literal of the form not (a), where 
a G T. 

Clearly, P F . T C P T . 

We define an operator B(F) as follows: 

B(F) = LM(P* T ), 

where T = GL(F) and Pp T abbreviates (Pf,t) H - The following result gathers key 
properties of the operator B. 

Theorem 3.1 

Let P be a normal logic program. Then: 

1. B is monotone 

2. For every F C At(P), A(F) C B(F) 

3. For every F C F wfs , B{F) C F wfs 
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4. Ifp(B) = F wfs 

5. For every F C At(P), B(F) =FU(F\ LM(P£ T )), where T = GL(F). 

Proof: ([[]) Assume that Fi C P 2 . Set T 4 = GL{F\), i = 1, 2. Clearly, pJ C f\ and, 
by antimonotonicity of GL, Ti C T 2 . By the definition of Ppy, Pf 2 ,t 2 5= Pfi,Ti- 
Consequently, LM{P^ T ^) C LM(P£ iTi ) and, so, B(Pi) C S(V 2 ). 
(|) Let T = GL{F). Clearly, P F . T C P T . Thus, A(P) = LM(P%) C LM(P$ T ) = 
B(F). 

(||) We have, LM(PJ£ . ) = -F M / S . It follows that removing from , rules with 
heads in and those that contain an atom from F w f s in their bodies does not 
change the least model. That is, 

LM(P^ T J = LM(Pl fs ). 



Since, T wfs = GL(F wfs ), B(F wfs ) = LM(P^ Twfs ). Let F C F wfs . Then, by © 
B(-F) C B(F wfs ). Thus, we have 



P(F) C S(F W/S ) = LM(P£ wfs T J = LM(P£J = F wfs . 

d) The least fixpoint of S is given by = [j B l (9). By (|), C F wfs . On 

the other hand, by @) and (|), A l {%) C P l (0). Thus, = C It 

follows that lfp(B) — F w f s . 

(§) Let T = GL{F). Since P F , T has no rules with head in P, LM{P^ T ) C P and, 
consequently, P C B(F). Thus, the assertion follows. □ 
Theorem 3.1 allows us to prove the correctness of the following modification of 
Algorithm 1. 



Algorithm 2 

P:=0; 
repeat 

T := true(LM(PU not(P)); 

AP:=P\LM(P£ T ); 

P :=PU AP; 
until no change in P; 
return T and P. 



By Theorem 3.1, each iteration of the repeat loop computes B(F) as the new 
value for the set P. More formally, the set P just after iteration i, satisfies P = 
B l (%). Thus, when the algorithm terminates, the set P that is returned is the least 



fixpoint of B. Consequently, by Theorem 3.1 ( 4b, Algorithm 2 is correct 



We will now modify Algorithm 2 to obtain a general template for an alternating- 
fixpoint algorithm to compute the well-founded semantics. The key idea is to ob- 
serve that it is enough to compute a subset of AP in each iteration and the algorithm 
remains correct. 

Let us assume that for some operator A w defined for pairs (F,Q), where P C 
At(P) and Q is a Horn program such that At(Q) C P (the complement is, as always, 
evaluated with respect to At(P)), we have: 
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(Wl) A W (F,Q)CF\LM(Q) 

(W2) A W (F, Q) = if and only if F \ LM(Q) = 0. 

Let F C Ai(P). By the definition of P F<T , At(P£ T ) C F. Thus, we define 
B W (F) = F U A W (F,P£ T ), where T = true(LM(P U not(F))). It is clear that 
for every F C ^i(F), F C B W (F) C F(F), the latter inclusion follows from Theo- 
rem 3T(5|) and (Wl). Consequently, for every i, 



F/ (0) C B'(0). 

It follows that B l w ($) C lfp(B) — F w f s . It also follows that there is the first i such 
that F* u (0) = F^ +1 (0). Let us denote this set F*„(0) by F . Then F C F w/s . In the 
same time, by condition (W2), B(Fq) — Fq. Since F w f s is the least fixpoint of B, 
F w f s C Jig. It follows that a modification of Algorithm 2 in which line 

AF := F \ LM(Pp T ); 

is replaced by 

AF := A W (F, Ppj>); 

correctly computes the well-founded semantics of a program P. Thus, we obtain 
the following algorithm for computing the well-founded semantics. 



Algorithm 3 

F:=0; 
repeat 

T := true(LM(PU not(F)); 

AF:= A W (F,P^ T ); 

F :=FU AF; 
until no change in F; 
return T and F. 

We will now refine Algorithm 3. Specifically, we will show that the sets T and F 
can be computed incrementally. 

Let R be a Horn program. We define the residual program of R, res(R), to be the 
Horn program obtained from R by removing all rules of R with the head in LM(R) 
and by removing from the bodies of the remaining rules those elements that are in 
LM(R). We have the following technical result. 

Lemma 3.2 

Let R be a Horn program and let M be a set of atoms such that M fl head(R) = 0. 
Then LM(R U M) = LM(R) U LM(res(R) U M). □ 

Lemma |3.2| implies that (we treat here negated literals as new atoms and P as 
Horn program over the extended alphabet) 

LM(P U not(F U AF)) = LM(P U not(F)) U LM(res(P U not(F)) U not(AF)). 

Thus, if the set F is expanded by new elements from AF, then the new set T can 
be computed by increasing the old set T by AT = true(LM(res(P U not(F)) U 
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not(AT))). Important thing to note is that the increment AT can be computed 
on the basis of the residual program and the increment AT. Similarly, we have 

Pfuaf,tuat = (Pf.t)af,at- 

Thus, computing Pf,t can also be done incrementally on the basis of the program 
considered in the previous iteration by taking into account most recently computed 
increments AT and AT. 

This discussion implies that Algorithm 3 can be equivalently restated as follows: 



Algorithm 3 

1 T := T := AT := AT := 0; 

2 R:= P; (*R will be treated as a Horn program *) 

3 Q:=P; 

4 repeat 

5 AT := true(LM{R U not(AT)); 

6 i?:=res(i?Unot(AT)); 

7 T := T U AT; 



Q 



{AF,ATi 



9 AT := A W (F, Q h ); 

10 T := T U AT; 

11 until no change in T; 

12 return T and T. 

We will now estimate the running time of Algorithm 3. Clearly line 1 requires 
constant time. Setting up appropriate data structures for programs R and Q (lines 
2 and 3) takes 0(size(P)) steps. In each iteration, AT is computed and the cur- 
rent program R is replaced by the program res(R U not (AT)) (lines 5 and 6). By 



modifying the algorithm from ( Dowling fc Gallier, 1984 ) and assuming that R is 



already stored in the memory (it is avaliable either as the result of the initialization 
in the case of the first iteration or as a result of the computation in the previous 
iteration), both tasks can be accomplished in 0(size(R°) + |AT| — size(R n )) steps. 
Here R° denotes the old version of R and R n denotes the new version of R. Con- 
sequently, the total time needed for lines 5 and 6 over all iterations is given by 
0(size(P) + \At(P)\ — size(R t )) = 0(size(P)) (where R { is the program R, when 
the algorithm terminates). The time needed for all lines 7 is proportional to the 
number of iterations and is 0(\At{P)\) = 0(size(P)). 

Given a logic program Q and sets of atoms AT and AT, it takes 0(size(Q) — 
size(QAF,AT) + |AT| + |AT|) steps to compute the program Qaf,at in line 8. We 
assume here that Q is already in the memory as a result of the initialization in 
the case of the first iteration, or as the result of the computation in the previous 
iteration, otherwise. It follows that the total time over all iterations needed to 
execute line 8 is 0(size(P) + \At(P)\) = 0(size(P)). 

Thus, we obtain that the running time of Algorithm 3 is given by 0(size(P)+m), 
where m is the total time needed to compute A W (F, Q h ) over all iterations of the 
algorithm. 
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In the standard (Van Gelder's) implementation of Algorithm 3, we compute the 
whole set F \ LM{Q h ) as A w (F,Q h ). In addition, computation is performed in a 
bottom-up fashion. That is, we first compute the least model of Q h and then its 
complement with respect to F. Such approach requires 0(size(Q h )) = 0(size(P)) 
steps per iteration to execute line 9 and leads to 0(\At(P)\ x size(P)) running-time 
estimate for the alternating-fixpoint algorithm. 



4 Procedure A w 

In this section we will focus on the class of programs, £P\, that is, programs whose 
rules have no more than one positive atom in their bodies. We assume that we have 
a procedure false that, given a Horn program Q G CVi, returns a subset of the 
set At(Q) \ LM(Q). We also assume that false returns the empty set if and only if 
At(Q) = LM(Q). For every pair (F, Q), where F C At(P) and Q is a Horn program 
such that At(Q) C F, we define 

A w (F,Q)=false(Q). 

It is easy to see that this operator A W (F,Q) satisfies conditions (Wl) and (W2). 
Consequently, it can be used in Algorithm 3. Clearly, the procedure A w and its 
computational properties are determined by the procedure false. In the remainder 
of the paper, we will describe a particular implementation of the procedure false 
and estimate its running time. We will use this estimate to obtain a bound on the 
running time of the resulting version of Algorithm 3. 

A straightforward way to compute the least model of Q and so, to find At(Q) \ 
LM(Q), is "bottom-up". That is, we start with atoms which are heads of rules 
with the empty bodies and use the rules of Q to compute all atoms in LM(Q) 
by iterating the van Emden-Kowalski operator. An efficient implementation of the 



process is provided by the Dowling-Gallier algorithm ( Dowling fc Gallier, 1984 ). 

The approach we follow here in the procedure false is "top-down" and gives us, 
in general, only a part of the set At(Q) \ LM(Q). More precisely, for an atom a we 
proceed "backwards" attempting to construct a proof or to demonstrate that no 
proof exists. In the process, we either go back to an atom that is the head of a rule 
with empty body or we show that no proof exists. In the former case, a 6 LM (Q). 
In the latter one, none of the atoms considered while searching for a proof of a 
are in LM(Q) (because Q € CP\ and each rule has at most one antecedent). The 
problem is that we may find an atom a that does not have a proof only after we 
look at all other atoms first. Thus, in the worst case, finding one new false atom 
may require time that is proportional to the size of Q. 

To improve the time performance, we look for proofs simultaneously for all atoms 
and grow the proofs "backwards" in a carefully controlled way. Namely, we never 
let one search to get too much ahead of the other searches. This controlled way of 
looking for proofs is the key idea of our approach and leads to a better performance. 
We will now provide an informal description of the procedure false followed later 
by a formal specification and an example. 

In the procedure, we make use of a new atom, say s, different from all atoms 
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occurring in Q. Further, we denote by head(r) the atom in the head of a rule r £ Q 
and by tail(r) the atom which is either the unique positive atom in the body of 
r, if such an atom exists, or s otherwise. We call an atom a £ At(Q) accessible if 
there are rules fi, . . . , rjt in Q such that tail(ri+i) = head(ri), for i = 1, . . . , k — 1, 
tail(r\) = s and head(rk) = a. Clearly, the least model LM(Q) of Q is precisely 
the set of all accessible atoms. 

In each step of the algorithm, the set of atoms from At(Q) is partitioned into 
potentially false sets or pf-sets, for short. We say that a set v C At(Q) is a pf-set 
if for each pair of distinct atoms a,b £ v there are rules ri, . . . , in Q such that 
tail(ri + i) = head(ri) £ u, for i = 1, . . . , k — 1, tail{r\) = b and head(rk) — a. It 
is clear that if w is a pf-set then either all its elements are accessible (belong to 
the least model of Q) or none of them does (they are all false). Clearly, singleton 
sets consisting of individual atoms in At{Q) are pf-sets. In the algorithm, with each 
pf-set we maintain its cardinality. 

Current information about the state of all top-down searches and about the 
dependencies among atoms, that were discovered so far, is maintained in a directed 
graph Q. The vertex set of this graph, say <S, consists of {s} and of a family of pf- 
sets forming a partition of the set At(Q). The edges of Q are specified by a partial 
function pred : S — > S. We write pred(v) = undefined if pred is undefined for v. 
Thus, the set of edges of Q is given by {(pred(v), v): pred(v) ^ undefined}. Since 
pred is a partial function, it is easy to see that the connected components of the 
graph Q are unicyclic graphs or trees rooted in those vertices v for which pred(v) 
is undefined. Throughout the algorithm we always have pred({s}) = undefined. 
Thus, the connected component of Q containing {s} is always a tree and {s} is its 
root. 

If w and v are two different pf-sets, the existence of the edge (w, v) in Q means 
that we have already discovered a rule in the original program whose head is in v 
and whose tail is in w. Thus, if vertices in w are accessible, then so are the vertices 
in v. A pf-set that is the root of a tree forming a component of Q is called an active 
pf-set. If v is an active pf-set then no rule r with head(r) £ v and tail(r) £ v 
has been detected so far. Thus, v is a candidate for a set of atoms which does not 
intersect the least model of Q. Let us note that even though {s} is a root of a tree 
in Q it is never active as it is not a pf-set in the first place. 

We let active pf-sets grow by gluing them with other pf-sets. However, we allow 
to grow only these active pf-sets whose cardinalities are the least. In each iteration 
of the algorithm the value of the variable size is a lower bound for the cardinalities 
of active pf-sets. To grow an active pf-set v, we look for rules with heads in v and 
with tails in pf-sets other than v (not necessarily active) or in {s}. The dependencies 
between pf-sets discovered in this way are represented as new directed edges in Q. 
Pf-sets that appear in the same cycle are glued together (in the procedure cycle). 
Since {s} is not an active pf-set, it never becomes an element of a cycle in Q. 

If, when attempting to grow a pf-set v we discover a rule with head in v and with 
the tail in a vertex of the tree of Q rooted in {s}, then v is from now on ignored 
(all its vertices belong to the least model of Q). Indeed, v gets connected to a tree 
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of Q rooted in {s}. Consequently, it cannot become a member of a cycle in Q in the 
future and is never again considered by the procedure cycle. 

The main loop (lines 6-23) of the algorithm false below starts by incrementing 
size followed by a call to the procedure cycle(S, pred, size, L). This procedure scans 
the graph Q and identifies all its cycles. It then modifies Q by considering each cycle 
and by gluing its pf-sets into a single pf-set. To this end, it modifies the vertex set 
S of G and the function pred defining the edges of Q. Each such new pf-set becomes 
the root of its tree in Q and so, it becomes active. The procedure cycle computes the 
cardinality of each new active pf-set. Finally, it creates a list L so that it consists 
of active pf-sets of cardinality size. If no such set is found (L is empty), we move 
on to the next iteration of the main loop and increment size by 1. We give a more 
detailed description of the procedure cycle later in the paper when we analyze the 
time complexity of our method. 

For each active pf-set v € L we consider the tail of each rule with head in v (lines 
9-22). If there is a rule r with head(r) e v and tail(r) £ v then it is detected (line 
15). The value pred(v) is set to this element in S that contains tail(r) (it may be 
that this set is {s}). We also set the variable success to true (line 16). The pf-set 
v stops to be active. We move on to the next active pf-set on L. 

If such a rule r does not exist then success = false and v is a set of cardinality 
size consisting of atoms which are not in the least model of Q. This set is returned 
by the procedure false (line 21). Hence, for an active pf-set considered in the loop 
6-23, either we find a pf-set pred(v) s S \ {v} (and we have to consider the next 
pf-set on L) or v is returned as a set of atoms which are not in the least model of 
Q (and the procedure false terminates). Thus, the procedure false is completed if 
either a nonempty set v of atoms which are not in the least model of Q is found or, 
after some passes of the loop 6-23, the graph Q has no active pf-sets. In the latter 
case Q is a tree with the root in {s}. Thus, At(Q) = LM(Q) and v = is returned 
(line 24). 

In the procedure false, as formally described below, an input program Q is rep- 
resented by lists IN (a), a 6 At(Q), of all atoms b such that b is the body of some 
rule with the head a. If there is a rule with the head a and empty body, we insert 
s into the list IN (a). 

We also use an operation next on lists and elements. Let I be a list and w be an 
element, either belonging to I or having a special value undefined. Then 



The value undefined should not be mixed with nil which indicates the end of a 
list. 

Finally, we use a procedure findset(w, S) which, for an atom w and a collection 
S of disjoint sets, one of which contains w, finds the name of the set in S con- 
taining w (it follows from our assumptions that such a set is unique). Elements of 
S are maintained as linked lists. Each element on such a list has a pointer to the 
head of the list. The head serves as the identifier for the list. When the procedure 
findset(w, S) is called, it returns the head of the list to which w belongs. 




the next element after w in I 



if w el 

if w is undefined. 
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1 procedure false(Q); 

2 S := {{x} : x E At(Q)} U {{s}}; 

3 for v G S do pred{v) := undefined; 

4 for x £ At(Q) do := undefined; cardinality(x) := 1}; 

5 size := 0; 

6 while size < \At(Q)\ do 

7 {size := size + 1; 

8 cycle(S,pred,size,L); 

9 for all v G L do 

10 {success := false; 

11 u :— next(u, v); 

12 while u =^ nil and not success do 

13 w(u) := next(w(u), IN(u)); 

14 while u>(u) ^ nil and not success do 

15 {if findset(w(u), S) ^ w 

16 then {success := true; pred(v) := finds et(w(u),S)} 

17 else w(u) := next(w(u), IN(u)) 

18 end while (14)}; 

19 if not success then u := next(u,v) 

20 end while (12)}; 

21 if not success then return v (* the procedure terminates *) 

22 end for (9)} 

23 end while (6)}; 

24 return v = 

25 end false; 



We will now illustrate the operation of the algorithm. Let us consider the following 
Horn logic program Q: 

a <— b <— a a <— c c^a a e <i «— e 

/ <- d e <- / d <- / e^5 g <- j j <- g 
i <— j j <— /i fc <— j fc <— /i /i <— fc 

This program is represented as a graph, , in Fig. [I]. The vertices of this graph 
correspond to the atoms of the program. In addition, G"^ has an auxiliary vertex 
s G" ^4t(Q). An edge (x,y), where x,y G ^4i(Q), represents the clause y <— x from 
Q. An edge (s, y), where t/ G At(Q), represents the clause y <— . When illustrating 
the algorithm, we assume that atoms from At(Q) (atoms a, . . . , k in our example) 
appear on the lists IN(x), x G At(Q), in the alphabetical order. We also assume 
that whenever s belongs to a list IN(x), it appears as the first atom on the list. 

In the algorithm false, the current state of knowledge about the possibility of 
proving an atom from Q is represented by the graph Q. Initially, Q consists of 
isolated vertices. Indeed, line 3 of the algorithm sets pred(x) to undefined, for every 
vertex x of S (see Fig. ^ (left)). All of the vertices of Q, except for {s} are active 
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Fig. 1. Graph representing program Q. 

pf-sets. The procedure cycle (line 8), called with size = 1, puts all of them on the 
list L. 

The algorithm considers next (line 9) all elements on the list L, that is, all 
vertices of Q that are active pf-sets and have cardinality equal to size. During the 
first iteration of the loop 6-23, L consists of all vertices of G, except for {s} (that 
is, singleton sets {x}, where x <E At(Q) = V(G < *) \ {s}). For each vertex v of Q on 
L, the algorithm looks for a back rule for v, that is, a rule in Q with the head in 
v and the tail in a pf-set other than v or in {s}. In our graphical representation of 
Q by means of the graph , a back rule for v corresponds to an edge (referred to 
as a back edge) in G® with the head in v and the tail in a vertex of Q other than v 
(possibly in {s}). To find a back rule (edge) for v, all atoms u of Q (equivalently, 
all vertices u of G®) that belong to v are considered (the loop 12-20). For each such 
atom u, the algorithm searches for the first atom on the list IN(u) that does not 
belong to v. Let us recall that IN(u) is the list of atoms that are the tails of rules 
with the head u or, in the terms of the graph , that are the tails of edges with 
the head u. If such an atom is found, together with u it determines a back rule 
(edge) r for v. The algorithm sets pred(v) to be equal to the pf-set containing the 
tail of r (line 16). That is, an edge from pred(v) to v is added to Q. The algorithm 
moves then on to the next element of the list L. 

In our example, in the first iteration of the loop 6-23, a back rule is found for 
every element on L, that is, for every vertex of Q other than {s}. For instance, for 
the vertex {d}, the algorithm considers atoms on the list IN(d) — (e, /) (let us 
recall that atoms on lists IN(x) are arranged alphabetically with the exception of 
the special atom s which, if present on a list, is always its first element). The first 
atom on the list, e does not belong to {d}. Thus, it defines, together with d a back 
rule for {d}, d <— e. The resulting graph Q is shown in Fig. on the right. 

Let us note that when scanning the list IN(d) in subsequent iterations the algo- 
rithm resumes the scan with the first atom that has not been looked at yet (cf. the 
definition of the operation next). Thus, the next time d is considered as an element 
of an active pf-set for which a back rule is searched for, the scan of IN(d) will start 
with /. The same holds true for all lists IN(x), x s At(Q). Consequently, each atom 
on each of these lists is considered just once. Such an approach still guarantees that 
finding back rules works correctly (that is, that they are found by the algorithm 
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Fig. 2. Graph £/ initially (left side) and after the first iteration of the loop 6-23 (on 
the right). 



whenever they exist). Indeed, when an atom on a list IN(x) is considered, it cither 
defines a back rule with the head x (and, thus, cannot define any new back rule 
with the head x in the future) or it is in the same active pf-set as x (and, thus, it 
neither defines a back rule now nor it will define it in the future, as it will remain 
in the same pf-set as x till the algorithm terminates). 

The second iteration of the loop 6-23 starts with the procedure cycle contracting 
each cycle in the graph Q to a single vertex. The resulting graph is shown in Fig. [| 
on the left. The procedure cycle then creates a new list L. It consists of all active 
pf-sets of cardinality 2. In our case, L contains {g,j} and {h,k} ({d, e,/} is also 
active but has cardinality 3). 

Continuing with the second iteration, the algorithm next considers each vertex 
on L (the loop 9-22) and looks for back rules. In this iteration, a back rule is found 
for each of the nodes on L and the modified graph Q is given in Fig. [| on the right. 



{c} 



(«}| \d, 



Is) 



{/} 



ej] {gj} {h,k} 



{b} {a}'< {d,ej} {gj} 

O 



Fig. 3. Graph Q after the execution of the procedure cycle in the second iteration 
of the loop 6-23 (left) and after the second iteration of the loop 6-23 (right). 



In the third iteration, the procedure cycle contracts the only cycle in Q to a 
single active pf-set of cardinality 4 (Figure || left side). It also creates a new list 
L. This time it consists of active pf-sets of cardinality 3. There is just one such set 
- {d, e,/}. Subsequently, the algorithm false looks for a back rule for {d,e,f}. It 
starts by considering edges ending in d (line 11; we assume that v is represented 
by the list (d, e, /)). It scans the list IN(d) starting at the first atom that has not 
been inspected so far, that is, /. However, since / belongs to the same pf-set as d, 
f does not specify a back rule. Since there are no more atoms on the list IN(d), 
we move on to the next iteration of the loop 12-20 and consider atom e. We have 
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IN(e) = (/, g). Since / was already considered (and yielded a back rule for {e}) 
in the first iteration, we consider g. Since g {d,e,f}, it defines a back rule for 
{d,e,f}, d<- g. 



\b) {«}| {d,ej} 



is I 



{>•} 



{i} 



\g,hj,k] 



{b} {d,ej} {g,hj,k} 

Is) 



Fig. 4. Graph Q after the execution of the procedure cycle in the third iteration of 
the loop 6-23 (left) and after the third iteration of the loop 6-23 (right). 



The resulting graph Q is shown in Figure [| (on the right). It has no cycles. So, the 
only thing done by the procedure cycle in the iteration 4 is that it puts on L active 
pf-scts of cardinality 4. There is just one such set in Q, {g, h, j, k}. The algorithm 
false looks for a back edge for {<?, h, j, k} and does not find any. The variable success 
remains false. The algorithm returns {g,h,j,k} and terminates (line 21). Let us 
note that this set is a proper subset of the set At(Q) \ LM(Q). 

The following theorem formally establishes two key properties of the procedure 
false. 

Theorem 4-1 

1. The procedure false returns a set v such that v C At(Q) \ LM(Q). 

2. false returns the empty set if and only if At(Q) \ LM(Q) = 0. 

Proof: (1) The statement is trivially true if false returns the empty set. Thus assume 
that the returned set v ^ 0. It means that the value of the variable success is false 
after all passes of the loop 12-20 for some active pf-set v in the list L. Thus every 
rule in Q with the head in v has been considered. 

Suppose there is a rule r in Q with head(r) — u G v and tail(r) — b $ v. This rule 
was considered by the procedure false when u = head(r) was a member of some 
active pf-set, say y. Since larger pf-sets are obtained by gluing smaller ones, y C u. 
While r was being considered, the value of w(u) in the loop 14-18 was b and the 
value of v was y. Consequently, findset(b, S) ^ y in line 15 because y C v and b ^ v 
so b $ y. Hence the value of success was set to true and pred(y) was defined to 
be, say, z = findset(b,S) in line 16. The pf-set y stopped to be active. Recall that 
v is active when the procedure stops. Hence y had to be glued with other pf-sets 
to obtain v. This is, however, impossible because if y were glued with some other 
pf-sets to form a larger pf-set x then pred(y) = z C x. Notice that IiEzCiCd. 
We have got a contradiction with b v. 

Hence, there are no rules r in Q with head{r) G v and tail(r) g" v. Thus no atom 
in v is accessible so v C At(Q) \ LM{Q). 

(2) Suppose false returns the empty set and consider the last pass of the loop 6-23, 
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for size — \At(Q)\. If the list L is empty then no vertex of Q is an active pf-set. 
Hence, Q is a tree with the root {s}. Thus all atoms in At(Q) are accessible and 
consequently LM(Q) = At(Q). 

If the list L is nonempty then it contains one pf-set v = At(Q). The empty set 
is returned by the procedure false so the value of the variable success in line 16 
is true for v = At(Q). It means that for some rule r in Q with head(r) = u, 
w(u) = tail(r) v = At(Q) so w(u) — s. Hence, u is accessible and, consequently, 
all atoms in At{Q) are accessible. That is, we have At{Q) \ LM{Q) = 0. 

The converse of the implication proved above follows immediately from the first 
part of the theorem. □ 

We shall now consider the procedure cycle a little bit more carefully. The proce- 
dure can be informally written in the following form. 



procedure cycle(S, pred, size, L) 

1 . Initialize L to empty. 

2. Find all cycles C U C 2 ,...,C P in the graph Q. Put C = {Ci, C 2 , ■ C p }. 

3. For every cycle C — {v\, . . . ,v q }, C E C, do (i)-(iv). 

(i) set vc :=«iU...UD g ; 

(ii) compute cardinality(vc) (sum up the cardinalities of all vertices in C); 

(iii) update the set S: set S := (S — {v\, . . . , v q }) U {vc}; (* vc becomes 
an active pf-set *) 

(iv) update the function pred: for every i = 1, . . . , q, if pred(z) = Vi (for 
some ze5) then pred(z) :— vc] 

4. For every vertex of Q that is an active pf-set, if cardinality(v) — size, insert v 
into the list L. 

Since Q is a directed graph whose connected components are either unicyclic 
graphs or trees, step 2 of the procedure cycle can be implemented in 0(|<S|) time. 
Since pf-sets are represented as linked lists, with each node on the list pointing to 
the head of the list, step (i) can be implemented to take (9(|wc|) steps. The time 
needed for step (ii) is, clearly, 0(\C\). Each execution of step (iii) takes also 0(|C|). 
Finally, the running time of each execution of step (iv) is O(mc), where mc is the 
size of the connected component of the graph Q containing C. Thus, an iteration 
of the loop 3 for a cycle C e C takes 0(\C\ + mc + \vc\)- Clearly, |C| < mc- 
Moreover, Y,cec m c < |<5| - 1 < \At(Q)\ and Y^cec \ v c\ < \ At iQ)\ (they are all 
disjoint subsets of At(Q)). Thus, the total time needed for the loop 3 is 0(| ^4<(Q) |). 
It is easy to see that the time needed for the loop 4 is also 0{\At(Q)\). Consequently, 
the running time of the procedure cycle is 0(\At(Q)\). 

We are now in a position to estimate the running time of the procedure false. 

Lemma 4-2 

If the procedure false(Q) returns a nonempty set v, then the running time of false 
is 0(\v\ x |j4£(Q)|). If false(Q) returns the empty set then its running time is 
0(|^(Q)| 2 ). 

Proof: Let |^4f(Q)| = n and \v\ — k. As we have already observed the procedure 
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cycle runs in time 0(n). It is not hard to see that, since we represent all sets 
occurring in the procedure false as linked lists, with each node on a list pointing to 
the head of the list, the operations: findset and next require a constant time. 

First assume that the output v of the procedure false is nonempty. Let us estimate 
the number of passes of the while and for loops in the procedure. Clearly, the 
loop 6-23 is executed k times. Hence the total running time of all calls of the 
procedure cycle is 0(kn). The number of passes of the loop 9-22 is not larger than 
\L\\ + I-L2 + . . . + \Lk\, where Li denotes the list L in an iteration i of the loop. 
Since Li is a list of disjoint pf-sets of cardinality i, \Li\ < n, for each i — 1,2, ... ,k. 
Hence the number of passes of the loop 9-22 can be very roughly estimated by kn. 
The loop 12-20 is executed at most 



i=l veLi 

times. This inequality follows from the fact that the sets v in the lists Li are disjoint 
subsets of atoms so J2 v eL M — n - ^ ne estimation of the number of passes of the 
loop 14-18 is a little bit more complicated. First notice that in each execution of 
the loop we check a rule of the program Q and rules are checked only one time. 
The rules r checked in the loop have either both the head and the tail in some 
pf-sct v £ S or head(r) £ v and tail(r) is in some other pf-sct u £ S. In the 
latter case pred(v) is defined in line 16. The number of executions of line 16 is not 
larger than the number of passes of the loop 9-22 so it is bounded by kn. When 
the procedure returns the output, the pf-sets have cardinalities not larger than k. 
Hence the number of rules with both the head and the tail in the same pf-set that 
has been checked before the procedure stops is not larger than 



1) < (k-l)J2\ u \ ^ (k-l)n. 



Thus the number of passes of the loop 14-18 in the whole procedure false is less 
than 2kn. It follows that if the output v of false is nonempty then the running time 
of false is 0{\v\ x \At{Q)\). 

Now consider the case when the procedure false returns the empty set. Clearly 
the number of passes of the loop 6-23 is n so it takes 0(n 2 ) time for all executions 
of the procedure cycle. Since the rules are checked in the loop 14-18 only one time, 
the number of passes of this loop is not larger than the number m of rules in Q. 
Obviously m < n 2 so the running time of false in this case is 0(\At(Q)\ 2 ). □ 



By Lemma 4.2 and considerations in Section 3 we get an estimation of the running 



time of Algorithm 3. 
Theorem 4-3 

If P is a program whose rules have at most one positive atom in the body then 
Algorithm 3 can be implemented so that its running time is 0(\At(P)\ 2 + size(P)). 
□ 
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5 Conclusions 

The method for computing the well-founded semantics described in this paper is a 
refinement of the basic alternating-fixpoint algorithm. The key idea is to use a top- 
down search when identifying atoms that are false. Our method is designed to work 
with programs whose rules have at most one positive atom in their bodies (class 
CP\). Its running time is 0{\At(P)\ 2 + size(P)) (where P is an input program). 
Thus, our algorithm is an improvement over other known methods to compute the 
well-founded semantics for programs in the class CV\. Our algorithm runs in linear 
time for the class of programs P £ CP\ for which size(P) > |^4i(P)| 2 . However, it is 
not a linear-time algorithm in general. It is an open question whether a linear-time 
algorithm for computing the well-founded semantics for programs in the class CP\ 
exists. 

Our results extend to the class LVX . However, the extension is straightforward 
and the class LV^ is still rather narrow. Moreover, it is not specified syntactically 
(it is described by means of the Kripke-Kleene semantics). The question arises 
whether our top-down approach to positive-loop detection can be generalized to 
any class of programs significantly extending the class CP\ and possessing a simple 
syntactic description. 

Finally, let us note that the general problem of computing the well-founded se- 
mantics still remains a challenge. No significant improvement over the alternating- 
fixpoint algorithm of Van Gcldcr has been obtained for the class of arbitrary finite 
propositional logic programs. 
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